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CLAIMS 

1 . A device control device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 
5 specifying means (7) which specifies a content of the speech uttered by an utterer based 

on the words specified by the speech recognition means (2); and 

process execution means (7) which specifies a content of control to be performed on an 
external device to be a control target based on the specified content, and performs the control. 

2. The device control device according to claim 1, wherein the speech recognition means 
1 0 (2, 3) includes speech part specifying means (3) which specifies a part of speech of the specified 

words, and 

the specifying means (7) specifies a content of the speech uttered by the utterer based 
only on those of the words specified by the speech recognition means (2) which are specified as a 
predetermined part of speech. 
15 3. The device control device according to claim 2, wherein the specifying means (7) 

discriminates whether or not a combination of a plurality of words in the words specified by the 
speech recognition means (2, 3) which is specified as a predetermined part of speech meets a 
predetermined condition, and specifies a content of the speech uttered by the utterer based on a 
discrimination result. 

2 0 4. The device control device according to claim 1, wherein the specifying means (7) 

holds information which associates words with one or more categories, and specifies a content of 
the speech uttered by the utterer based on a category in which the words specified by the speech 
recognition means (2, 3) are classified. 

5. The device control device according to claim 1, wherein the specifying means (7) 

2 5 holds correlation information which associates words of different meanings or different categories 
with each process of the process execution means (7), and specifies a content of the speech 
uttered by the utterer based on a combination of those words or categories which are specified by 
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the speech recognition means (2, 3), and the correlation information. 

6. The device control device according to claim 1, wherein the specifying means (7) 
holds information which associates words with one or more categories, and specifies a content of 
the speech uttered by the utterer based on a category in which a plurality of words specified by 

5 the speech recognition means (2, 3) are commonly classified. 

7. The device control device according to claim 1, wherein the specifying means (7) 
holds a plurality of words assigned to respective processes of the process execution means (7), 
and performs a corresponding process when at least one of the words specified by the speech 
recognition means (2, 3) is a word assigned to the process. 

10 8. The device control device according to claim 1, wherein when a meaning of an input 

speech is not discriminatable, the specifying means (7) prompts an input in a more 
discriminatable expression. 

9. The device control device according to claim 1, further comprising information 
acquisition means (7) which acquires information from an external device, and 

1 5 wherein the specifying means (7) selects an output content to be output based on the 

information acquired by the information acquisition means (7). 

10. A device control device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 
2 0 specifying means (7) which specifies a content of the speech uttered by an utterer based 

on the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a content of control to be performed on an 
external device to be a control target based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
2 5 communication means (7); and 

speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 
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whereby when the control specified by the process specifying means (7) is to output 
information acquired by the information acquisition means (7), the speech output means (5) 
outputs a speech based on the information. 

11. A speech recognition device comprising: 

5 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a process to be performed based on the 
1 0 specified content, and performs the process. 

12. The speech recognition device according to claim 11, wherein the speech recognition 
means (2, 3) includes speech part specifying means (3) which specifies a part of speech of the 
specified words, and 

the specifying means (7) specifies a content of the speech uttered by the utterer based 
1 5 only on those of the words specified by the speech recognition means (2, 3) which are specified 
as a predetermined part of speech. 

13. The speech recognition device according to claim 11, wherein the specifying means 
(7) holds information which associates words with one or more categories, and specifies a content 
of the speech uttered by the utterer based on a category in which the words specified by the 

2 0 speech recognition means are classified. 

14. The speech recognition device according to claim 11, wherein the specifying means 
(7) holds correlation information which associates words of different meanings or different 
categories with each process of the process execution means (7), and specifies a content of the 
speech uttered by the utterer based on a combination of those words or categories which are 

2 5 specified by the speech recognition means, and the correlation information. 

15. The speech recognition device according to claim 11, wherein the specifying means 
(7) holds information which associates words with one or more categories, and specifies a content 
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of the speech uttered by the utterer based on a category in which a plurality of words specified by 
the speech recognition means are commonly classified. 

16. The speech recognition device according to claim 11, wherein the specifying means 
(7) holds a plurality of words assigned to respective processes of the process execution means (7), 

5 and performs a corresponding process when at least one of the words specified by the speech 
recognition means (2, 3) is a word assigned to the process. 

17. The speech recognition device according to claim 11, wherein when a meaning of an 
input speech is not discriminatable, the specifying means (7) prompts an input in a more 
discriminatable expression. 

10 18. The speech recognition device according to claim 11, further comprising information 

acquisition means (7) which acquires information from an external device, and 

wherein the specifying means (7) selects an output content to be output based on the 
information acquired by the information acquisition means (7). 
19. A speech recognition device comprising: 
1 5 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a process to be performed based on the 
2 0 specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (74); and 

speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 
2 5 whereby when the process specified by the process specifying means (7) is to output 

information acquired by the information acquisition means (7), the speech output means (5) 
outputs a speech based on the information. 
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20. An agent device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
5 on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a process to be performed based on the 
specified content, and performs the process. 

21. The agent device according to claim 20, wherein the speech recognition means (2, 3) 
includes speech part specifying means (3) which specifies a part of speech of the specified words, 

10 and 

the specifying means (7) specifies a content of the speech uttered by the utterer based 
only on those of the words specified by the speech recognition means (2, 3) which are specified 
as a predetermined part of speech. 

22. The agent device according to claim 21, wherein the specifying means (7) 

1 5 discriminates whether or not a combination of a plurality of words in the words specified by the 
speech recognition means (2, 3) which is specified as a predetermined part of speech meets a 
predetermined condition, and specifies a content of the speech uttered by the utterer based on a 
discrimination result. 

23. The agent device according to claim 20, wherein the specifying means (7) holds 

2 0 information which associates words with one or more categories, and specifies a content of the 
speech uttered by the utterer based on a category in which the words specified by the speech 
recognition means are classified. 

24. The agent device according to claim 20, wherein the specifying means (7) holds 
correlation information which associates words of different meanings or different categories with 

2 5 each process of the process execution means (7), and specifies a content of the speech uttered by 
the utterer based on a combination of those words or categories which are specified by the speech 
recognition means (2, 3), and the correlation information. 



25. The agent device according to claim 20, wherein the specifying means (7) holds 
information which associates words with one or more categories, and specifies a content of the 
speech uttered by the utterer based on a category in which a plurality of words specified by the 
speech recognition means (2, 3) are commonly classified. 
5 26. The agent device according to claim 20, wherein the specifying means (7) holds a 

plurality of words assigned to respective processes of the process execution means (7), and 
performs a corresponding process when at least one of the words specified by the speech 
recognition means (2, 3) is a word assigned to the process. 

27. The agent device according to claim 20, wherein when a meaning of an input speech 
10 is not discriminatable, the specifying means (7) prompts an input in a more discriminatable 

expression. 

28. The agent device according to claim 20, further comprising information acquisition 
means (7) which acquires information from an external device, and 

wherein the specifying means (7) selects an output content to be output based on the 
1 5 information acquired by the information acquisition means (7). 

29. The agent device according to claim 20, wherein the specifying means (7) includes 
means which, when the process specified as a process to be performed is a process of presenting 
information externally received to the utterer, performs the presentation by generating a speech 
which reads out the information. 

2 0 30. An agent device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); 
2 5 process specifying means (7) which specifies a process to be performed based on the 

specified content; 

information acquisition means (7) which acquires information via predetermined 
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communication means (74); and 

speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 

whereby when the process specified by the process specifying means (7) is to output 
5 information acquired by the information acquisition means (7), the speech output means (5) 
outputs a speech based on the information. 

31. An on- vehicle device control device so constructed as to be mountable on a vehicle 
having an external on-vehicle device mounted thereon, comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
1 0 specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a content of control to be performed on the 
on-vehicle device based on the specified content, and performs the control. 
1 5 32. The on-vehicle device control device according to claim 31, wherein the speech 

recognition means (2, 3) includes speech part specifying means (3) which specifies a part of 
speech of the specified words, and 

the specifying means (7) specifies a content of the speech uttered by the utterer based 
only on those of the words specified by the speech recognition means (2, 3) which are specified 
2 0 as a predetermined part of speech. 

33. The on-vehicle device control device according to claim 31, wherein the specifying 
means (7) holds information which associates words with one or more categories, and specifies a 
content of the speech uttered by the utterer based on a category in which the words specified by 
the speech recognition means (2, 3) are classified. 
2 5 34. The on-vehicle device control device according to claim 31, wherein the specifying 

means (7) holds correlation information which associates words of different meanings or different 
categories with each process of the process execution means (7), and specifies a content of the 
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speech uttered by the utterer based on a combination of those words or categories which are 
specified by the speech recognition means, and the correlation information. 

35. The on-vehicle device control device according to claim 31, wherein the specifying 
means (7) holds information which associates words with one or more categories, and specifies a 

5 content of the speech uttered by the utterer based on a category in which a plurality of words 
specified by the speech recognition means (2, 3) are commonly classified. 

36. The on-vehicle device control device according to claim 31, wherein the specifying 
means (7) holds a plurality of words assigned to respective processes of the process execution 
means (7), and performs a corresponding process when at least one of the words specified by the 

1 0 speech recognition means (2, 3) is a word assigned to the process. 

37. The on-vehicle device control device according to claim 31, wherein when a 
meaning of an input speech is not discriminatable, the specifying means (7) prompts an input in a 
more discriminatable expression. 

38. The on-vehicle device control device according to claim 31, further comprising 
1 5 information acquisition means (7) which acquires information from an external device, and 

wherein the specifying means (7) selects an output content to be output based on the 
information acquired by the information acquisition means (7). 

39. An on-vehicle device control device so constructed as to be mountable on a vehicle 
having an external on-vehicle device mounted thereon, comprising: 

2 0 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a content of control to be performed on the 
2 5 on-vehicle device control device based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (74); and 
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speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 

whereby when the control specified by the process specifying means (7) is to output 
information acquired by the information acquisition means (7), the speech output means (5) 
5 outputs a speech based on the information. 

40. A navigation device so constructed as to be mountable on a vehicle, comprising: 
speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 
specifying means (7) which specifies a content of the speech uttered by an utterer based 
10 on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a content of a navigation process to be 
performed based on the specified content, and performs the navigation process. 

41. The navigation device according to claim 40, wherein the speech recognition means 
(2, 3) includes speech part specifying means (3) which specifies a part of speech of the specified 

1 5 words, and 

the specifying means (7) specifies a content of the speech uttered by the utterer based 
only on those of the words specified by the speech recognition means (2, 3) which are specified 
as a predetermined part of speech. 

42. The navigation device according to claim 40, wherein the specifying means (7) holds 
2 0 information which associates words with one or more categories, and specifies a content of the 

speech uttered by the utterer based on a category in which the words specified by the speech 
recognition means (2, 3) are classified. 

43. The navigation device according to claim 40, wherein the specifying means (7) holds 
correlation information which associates words of different meanings or different categories with 

2 5 each process of the process execution means (7), and specifies a content of the speech uttered by 
the utterer based on a combination of those words or categories which are specified by the speech 
recognition means (2, 3), and the correlation information. 
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44. The navigation device according to claim 40, wherein the specifying means (7) holds 
information which associates words with one or more categories, and specifies a content of the 
speech uttered by the utterer based on a category in which a plurality of words specified by the 
speech recognition means (2, 3) are commonly classified. 
5 45. The navigation device according to claim 40, wherein the specifying means (7) holds 

a plurality of words assigned to respective processes of the process execution means (7), and 
performs a corresponding process when at least one of the words specified by the speech 
recognition means (2, 3) is a word assigned to the process. 

46. The navigation device according to claim 40, wherein when a meaning of an input 
1 0 speech is not discriminatable, the specifying means (7) prompts an input in a more 

discriminatable expression. 

47. The navigation device according to claim 40, further comprising information 
acquisition means (7) which acquires information from an external device, and 

wherein the specifying means (7) selects an output content to be output based on the 
1 5 information acquired by the information acquisition means (7). 

48. A navigation device so constructed as to be mountable on a vehicle, comprising: 
speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
2 0 on the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a content of a navigation process to be 
performed based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (7); and 
2 5 speech output means (5) which outputs a speech based on the information acquired by 

the information acquisition means (7), 

whereby when the navigation process specified by the process specifying means (7) is to 



output information acquired by the information acquisition means (7), the speech output means 
outputs a speech based on the information. 

49. An audio device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
5 specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a content of a speech process to be 
performed based on the specified content, and performs the speech process, or controls an 
1 0 external device in such a way as to cause the external device to perform the speech process. 

50. The audio device according to claim 49, wherein the speech recognition means (2, 3) 
includes speech part specifying means (3) which specifies a part of speech of the specified words, 
and 

the specifying means (7) specifies a content of the speech uttered by the utterer based 
1 5 only on those of the words specified by the speech recognition means (2, 3) which are specified 
as a predetermined part of speech. 

51. The audio device according to claim 49, wherein the specifying means (7) holds 
information which associates words with one or more categories, and specifies a content of the 
speech uttered by the utterer based on a category in which the words specified by the speech 

2 0 recognition means (2, 3) are classified. 

52. The audio device according to claim 49, wherein the specifying means (7) holds 
correlation information which associates words of different meanings or different categories with 
each process of the process execution means (7), and specifies a content of the speech uttered by 
the utterer based on a combination of those words or categories which are specified by the speech 

2 5 recognition means (2, 3), and the correlation information. 

53. The audio device according to claim 49, wherein the specifying means (7) holds 
information which associates words with one or more categories, and specifies a content of the 
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speech uttered by the utterer based on a category in which a plurality of words specified by the 
speech recognition means (2, 3) are commonly classified. 

54. The audio device according to claim 49, wherein the specifying means (7) holds a 
plurality of words assigned to respective processes of the process execution means (7), and 

5 performs a corresponding process when at least one of the words specified by the speech 
recognition means (2, 3) is a word assigned to the process. 

55. The audio device according to claim 49, wherein when a meaning of an input speech 
is not discriminatable, the specifying means (7) prompts an input in a more discriminatable 
expression. 

1 0 56. The audio device according to claim 49, further comprising information acquisition 

means (7) which acquires information from an external device, and 

wherein the specifying means (7) selects an output content to be output based on the 
information acquired by the information acquisition means (7). 
57. An audio device comprising: 
1 5 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means which specifies a content of the speech uttered by an utterer based on 
the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a content of a speech process to be 
2 0 performed based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (7); and 

speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 
2 5 whereby when the speech process specified by the process specifying means (7) is to 

output information acquired by the information acquisition means (7), the speech output means 
(5) outputs a speech based on the information. 
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58. A device control method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; and 

a process execution step of specifying a content of control to be performed on an external 
device to be a control target based on the specified content, and performing the control. 

59. A device control method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; 

a process specifying step of specifying a content of control to be performed on an 
external device to be a control target based on the specified content; 

an information acquisition step of acquiring information via a predetermined 
communication device; and 

a speech output step of outputting a speech based on the information acquired in the 
information acquisition step, 

whereby when the control specified in the process specifying step is to output 
information acquired in the information acquisition step, a speech is output based on the 
information in the speech output step. 

60. A speech recognition method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; and 

a process execution step of specifying specifies a process to be performed based on the 
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specified content, and performing the process. 

61 . A speech recognition method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; 

a process specifying step of specifying a process to be performed based on the specified 

content; 

an information acquisition step of acquiring information via a predetermined 
communication device; and 

a speech output step of outputting a speech based on the information acquired in the 
information acquisition step, 

whereby when the process specified in the process specifying step is to output 
information acquired in the information acquisition step, a speech is output based on the 
information in the speech output step. 

62. An agent processing method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying specifies a content of the speech uttered by an utterer 
based on the words specified in the speech recognition step; and 

a process execution step of specifying a process to be performed based on the specified 
content, and performing the process. 

63. An agent processing method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; 



a process specifying step of specifying a process to be performed based on the specified 

content; 

an information acquisition step of acquiring information via a predetermined 
communication device; and 
5 a speech output step of outputting a speech based on the information acquired in the 

information acquisition step, 

whereby when the process specified in the process specifying step is to output 
information acquired in the information acquisition step, a speech is output based on the 
information in the speech output step. 
10 64. An on- vehicle device control method for controlling an on- vehicle device mounted 

on a vehicle, comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
1 5 words specified in the speech recognition step; and 

a process execution step of specifying a content of control to be performed on the on- 
vehicle device based on the specified content, and performing the control. 

65. An on- vehicle device control method for controlling an on- vehicle device mounted 
on a vehicle, comprising: 

20 a speech recognition step of acquiring speech data representing a speech and specifies 

words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step: 

a process specifying step of specifying a content of control to be performed on the on- 
2 5 vehicle device control device based on the specified content; 

an information acquisition step of acquiring information via a predetermined 
communication device; and 
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a speech output step of outputting a speech based on the information acquired in the 
information acquisition step, 

whereby when the control specified in the process specifying step is to output 
information acquired in the information acquisition step, a speech is output based on the 
information in the speech output step. 

66. A navigation method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; and 

a process execution step of specifying a content of a navigation process to be performed 
based on the specified content, and performing the navigation process. 

67. A navigation method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; 

a process specifying step of specifying a content of a navigation process to be performed 
based on the specified content; 

an information acquisition step of acquiring information via a predetermined 
communication device; and 

a speech output step of outputting a speech based on the information acquired in the 
information acquisition step, 

whereby when the navigation process specified in the process specifying step is to output 
information acquired in the information acquisition step, a speech is output based on the 
information in the speech output. 

68. An audio device control method comprising: 
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a speech recognition step of acquiring speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; and 
5 a process execution step of specifying a content of a speech process to be performed 

based on the specified content, and performing the speech process, or controlling an external 
audio device in such a way as to cause the external device to perform the speech process. 

69. An audio device control method comprising: 

a speech recognition step of acquiring speech data representing a speech and specifies 
1 0 words represented by the speech by performing speech recognition on the speech data; 

a specifying step of specifying a content of the speech uttered by an utterer based on the 
words specified in the speech recognition step; 

a process specifying step of specifying a content of a speech process to be performed by 
an external audio device based on the specified content; 
15 an information acquisition step of acquiring information via a predetermined 

communication device; and 

a speech output step of outputting a speech based on the information acquired in the 
information acquisition step, 

whereby when the speech process specified in the process specifying step is to output 
2 0 information acquired in the information acquisition step, a speech is output based on the 
information in the speech output step. 

70. A program which allows a computer to function as a device control device 
comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
2 5 specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); and 
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process execution means (7) which specifies a content of control to be performed on an 
external device to be a control target based on the specified content, and performs the control. 

71. A program which allows a computer to function as a device control device 
comprising: 

5 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means which specifies a content of the speech uttered by an utterer based on 
the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a content of control to be performed on an 
1 0 external device to be a control target based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (74); and 

speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 
1 5 whereby when the control specified by the process specifying means (7) is to output 

information acquired by the information acquisition means (7), the speech output means (5) 
outputs a speech based on the information. 

72. A program which allows a computer to function as a speech recognition device 
comprising: 

2 0 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means which specifies a content of the speech uttered by an utterer based on 
the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a process to be performed based on the 
2 5 specified content, and performs the process. 

73. A program which allows a computer to function as a speech recognition device 
comprising: 
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speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); 
5 process specifying means (7) which specifies a process to be performed based on the 

specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (74); and 

speech output means (5) which outputs a speech based on the information acquired by 
1 0 the information acquisition means (7), 

whereby when the process specified by the process specifying means (7) is to output 
information acquired by the information acquisition means (7), the speech output means (5) 
outputs a speech based on the information. 

74. A program which allows a computer to function as an agent device comprising: 

1 5 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a process to be performed based on the 
2 0 specified content, and performs the process. 

75. A program which allows a computer to function as an agent device comprising: 
speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
2 5 on the words specified by the speech recognition means (2, 3); 

process specifying means (7) which specifies a process to be performed based on the 
specified content; 
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information acquisition means (7) which acquires information via predetermined 
communication means (74); and 

speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 
5 whereby when the process specified by the process specifying means (7) is to output 

information acquired by the information acquisition means (7), the speech output means (5) 
outputs a speech based on the information. 

76. A program which allows a computer to function as an on- vehicle device control 
device so constructed as to be mountable on a vehicle having an external on- vehicle device 

1 0 mounted thereon, comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); and 
1 5 process execution means (7) which specifies a content of control to be performed on the 

on-vehicle device based on the specified content, and performs the control. 

77. A program which allows a computer to function as an on-vehicle device control 
device so constructed as to be mountable on a vehicle having an external on-vehicle device 
mounted thereon, comprising: 

2 0 speech recognition means (2, 3) which acquires speech data representing a speech and 

specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); 

process specifying means which specifies a content of control to be performed on the on- 
2 5 vehicle device control device based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (74); and 
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speech output means (5) which outputs a speech based on the information acquired by 
the information acquisition means (7), 

whereby when the control specified by the process specifying means (7) is to output 
information acquired by the information acquisition means (7), the speech output means (5) 
5 outputs a speech based on the information. 

78. A program which allows a computer to function as a navigation device so 
constructed as to be mountable on a vehicle, comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 
1 0 specifying means (7) which specifies a content of the speech uttered by an utterer based 

on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a content of a navigation process to be 
performed based on the specified content, and performs the navigation process. 

79. A program which allows a computer to function as a navigation device so 
1 5 constructed as to be mountable on a vehicle, comprising: 

speech recognition means which acquires speech data representing a speech and specifies 
words represented by the speech by performing speech recognition on the speech data; 

specifying means which specifies a content of the speech uttered by an utterer based on 
the words specified by the speech recognition means (2, 3); 
2 0 process specifying means which specifies a content of a navigation process to be 

performed based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (74); and 

speech output means (5) which outputs a speech based on the information acquired by 
2 5 the information acquisition means (7), 

whereby when the navigation process specified by the process specifying means (7) is to 
output information acquired by the information acquisition means (7), the speech output means 
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outputs a speech based on the information. 

80. A program which allows a computer to function as an audio device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 
5 specifying means (7) which specifies a content of the speech uttered by an utterer based 

on the words specified by the speech recognition means (2, 3); and 

process execution means (7) which specifies a content of a speech process to be 
performed based on the specified content, and performs the speech process, or controls an 
external device in such a way as to cause the external device to perform the speech process. 
10 81. A program which allows a computer to function as an audio device comprising: 

speech recognition means (2, 3) which acquires speech data representing a speech and 
specifies words represented by the speech by performing speech recognition on the speech data; 

specifying means (7) which specifies a content of the speech uttered by an utterer based 
on the words specified by the speech recognition means (2, 3); 
1 5 process specifying means (7) which specifies a content of a speech process to be 

performed based on the specified content; 

information acquisition means (7) which acquires information via predetermined 
communication means (7); and 

speech output means (5) which outputs a speech based on the information acquired by 
2 0 the information acquisition means (7), 

whereby when the speech process specified by the process specifying means (7) is to 
output information acquired by the information acquisition means (7), the speech output means 
(5) outputs a speech based on the information. 



